Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 6095 |
| Missing cells | 11353 |
| Missing cells (%) | 8.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 8.7 MiB |
| Average record size in memory | 1.5 KiB |
Variable types
| Text | 17 |
|---|---|
| Numeric | 2 |
| Categorical | 3 |
Year is highly overall correlated with original order | High correlation |
original order is highly overall correlated with Year | High correlation |
Sex is highly imbalanced (80.2%) | Imbalance |
Fatal (Y/N) is highly imbalanced (69.8%) | Imbalance |
Area has 413 (6.8%) missing values | Missing |
Location has 512 (8.4%) missing values | Missing |
Activity has 536 (8.8%) missing values | Missing |
Name has 207 (3.4%) missing values | Missing |
Sex has 578 (9.5%) missing values | Missing |
Age has 2721 (44.6%) missing values | Missing |
Time has 3247 (53.3%) missing values | Missing |
Species has 2996 (49.2%) missing values | Missing |
original order is uniformly distributed | Uniform |
Year has 124 (2.0%) zeros | Zeros |
Reproduction
| Analysis started | 2023-11-25 14:11:05.678150 |
|---|---|
| Analysis finished | 2023-11-25 14:11:19.658361 |
| Duration | 13.98 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Case Number
Text
| Distinct | 6078 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 402.5 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 10 |
| Mean length | 10.613718 |
| Min length | 6 |
Characters and Unicode
| Total characters | 64680 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6062 ? |
|---|---|
| Unique (%) | 99.5% |
Sample
| 1st row | 2017.06.11 |
|---|---|
| 2nd row | 2017.06.10.b |
| 3rd row | 2017.06.10.a |
| 4th row | 2017.06.07.R |
| 5th row | 2017.06.04 |
| Value | Count | Frequency (%) |
| 1923.00.00.a | 2 | < 0.1% |
| 1990.05.10 | 2 | < 0.1% |
| 2 | < 0.1% | |
| b | 2 | < 0.1% |
| 2009.12.18 | 2 | < 0.1% |
| 1954.00.00 | 2 | < 0.1% |
| 2013.10.05 | 2 | < 0.1% |
| 2014.08.02 | 2 | < 0.1% |
| 1915.07.06.a.r | 2 | < 0.1% |
| 2006.09.02 | 2 | < 0.1% |
| Other values (6071) | 6081 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 14088 | |
| 0 | 12992 | |
| 1 | 10237 | |
| 2 | 5887 | |
| 9 | 5798 | |
| 8 | 2706 | 4.2% |
| 6 | 2379 | 3.7% |
| 7 | 2150 | 3.3% |
| 5 | 2112 | 3.3% |
| 3 | 2066 | 3.2% |
| Other values (24) | 4265 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 48262 | |
| Other Punctuation | 14092 | 21.8% |
| Lowercase Letter | 1526 | 2.4% |
| Uppercase Letter | 757 | 1.2% |
| Dash Punctuation | 33 | 0.1% |
| Space Separator | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 638 | |
| b | 623 | |
| c | 131 | 8.6% |
| d | 52 | 3.4% |
| e | 29 | 1.9% |
| f | 17 | 1.1% |
| g | 11 | 0.7% |
| h | 8 | 0.5% |
| j | 5 | 0.3% |
| i | 5 | 0.3% |
| Other values (5) | 7 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12992 | |
| 1 | 10237 | |
| 2 | 5887 | |
| 9 | 5798 | |
| 8 | 2706 | 5.6% |
| 6 | 2379 | 4.9% |
| 7 | 2150 | 4.5% |
| 5 | 2112 | 4.4% |
| 3 | 2066 | 4.3% |
| 4 | 1935 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 14088 | |
| & | 2 | < 0.1% |
| , | 1 | < 0.1% |
| / | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 519 | |
| D | 119 | 15.7% |
| N | 119 | 15.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 33 |
Space Separator
| Value | Count | Frequency (%) |
| 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 62397 | |
| Latin | 2283 | 3.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 638 | |
| b | 623 | |
| R | 519 | |
| c | 131 | 5.7% |
| D | 119 | 5.2% |
| N | 119 | 5.2% |
| d | 52 | 2.3% |
| e | 29 | 1.3% |
| f | 17 | 0.7% |
| g | 11 | 0.5% |
| Other values (8) | 25 | 1.1% |
Common
| Value | Count | Frequency (%) |
| . | 14088 | |
| 0 | 12992 | |
| 1 | 10237 | |
| 2 | 5887 | |
| 9 | 5798 | |
| 8 | 2706 | 4.3% |
| 6 | 2379 | 3.8% |
| 7 | 2150 | 3.4% |
| 5 | 2112 | 3.4% |
| 3 | 2066 | 3.3% |
| Other values (6) | 1982 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64680 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 14088 | |
| 0 | 12992 | |
| 1 | 10237 | |
| 2 | 5887 | |
| 9 | 5798 | |
| 8 | 2706 | 4.2% |
| 6 | 2379 | 3.7% |
| 7 | 2150 | 3.3% |
| 5 | 2112 | 3.3% |
| 3 | 2066 | 3.2% |
| Other values (24) | 4265 | 6.6% |
Date
Text
| Distinct | 5197 |
|---|---|
| Distinct (%) | 85.3% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 405.3 KiB |
Length
| Max length | 64 |
|---|---|
| Median length | 10 |
| Mean length | 11.072202 |
| Min length | 5 |
Characters and Unicode
| Total characters | 67474 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4523 ? |
|---|---|
| Unique (%) | 74.2% |
Sample
| 1st row | 2017-06-11 |
|---|---|
| 2nd row | 2017-06-10 |
| 3rd row | 2017-06-10 |
| 4th row | Reported 07-Jun-2017 |
| 5th row | 2017-06-04 |
| Value | Count | Frequency (%) |
| reported | 513 | 7.3% |
| before | 85 | 1.2% |
| ca | 35 | 0.5% |
| no | 26 | 0.4% |
| date | 26 | 0.4% |
| summer | 17 | 0.2% |
| late | 15 | 0.2% |
| 13 | 0.2% | |
| early | 13 | 0.2% |
| 1905-05-10 | 11 | 0.2% |
| Other values (5238) | 6257 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 11700 | |
| 0 | 11041 | |
| 1 | 10043 | |
| 2 | 6254 | |
| 9 | 5530 | |
| 8 | 2543 | 3.8% |
| 5 | 2365 | 3.5% |
| 6 | 2334 | 3.5% |
| 3 | 2029 | 3.0% |
| 7 | 1996 | 3.0% |
| Other values (51) | 11639 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46064 | |
| Dash Punctuation | 11700 | 17.3% |
| Lowercase Letter | 6763 | 10.0% |
| Uppercase Letter | 1697 | 2.5% |
| Space Separator | 1114 | 1.7% |
| Other Punctuation | 122 | 0.2% |
| Close Punctuation | 6 | < 0.1% |
| Open Punctuation | 6 | < 0.1% |
| Control | 1 | < 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1555 | |
| r | 803 | |
| o | 735 | |
| p | 687 | |
| t | 655 | |
| d | 567 | 8.4% |
| a | 350 | 5.2% |
| u | 328 | 4.8% |
| n | 213 | 3.1% |
| l | 150 | 2.2% |
| Other values (12) | 720 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 517 | |
| J | 293 | |
| A | 174 | 10.3% |
| M | 130 | 7.7% |
| S | 118 | 7.0% |
| B | 95 | 5.6% |
| N | 90 | 5.3% |
| D | 78 | 4.6% |
| F | 57 | 3.4% |
| O | 53 | 3.1% |
| Other values (7) | 92 | 5.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11041 | |
| 1 | 10043 | |
| 2 | 6254 | |
| 9 | 5530 | |
| 8 | 2543 | 5.5% |
| 5 | 2365 | 5.1% |
| 6 | 2334 | 5.1% |
| 3 | 2029 | 4.4% |
| 7 | 1996 | 4.3% |
| 4 | 1929 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 79 | |
| , | 21 | 17.2% |
| " | 9 | 7.4% |
| & | 7 | 5.7% |
| ? | 4 | 3.3% |
| / | 2 | 1.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11700 |
Space Separator
| Value | Count | Frequency (%) |
| 1114 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6 |
Control
| Value | Count | Frequency (%) |
| 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 59014 | |
| Latin | 8460 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1555 | |
| r | 803 | |
| o | 735 | 8.7% |
| p | 687 | 8.1% |
| t | 655 | 7.7% |
| d | 567 | 6.7% |
| R | 517 | 6.1% |
| a | 350 | 4.1% |
| u | 328 | 3.9% |
| J | 293 | 3.5% |
| Other values (29) | 1970 |
Common
| Value | Count | Frequency (%) |
| - | 11700 | |
| 0 | 11041 | |
| 1 | 10043 | |
| 2 | 6254 | |
| 9 | 5530 | |
| 8 | 2543 | 4.3% |
| 5 | 2365 | 4.0% |
| 6 | 2334 | 4.0% |
| 3 | 2029 | 3.4% |
| 7 | 1996 | 3.4% |
| Other values (12) | 3179 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 67474 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 11700 | |
| 0 | 11041 | |
| 1 | 10043 | |
| 2 | 6254 | |
| 9 | 5530 | |
| 8 | 2543 | 3.8% |
| 5 | 2365 | 3.5% |
| 6 | 2334 | 3.5% |
| 3 | 2029 | 3.0% |
| 7 | 1996 | 3.0% |
| Other values (51) | 11639 |
Year
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 240 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1926.1973 |
| Minimum | 0 |
|---|---|
| Maximum | 2017 |
| Zeros | 124 |
| Zeros (%) | 2.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1862 |
| Q1 | 1942 |
| median | 1976 |
| Q3 | 2004 |
| 95-th percentile | 2015 |
| Maximum | 2017 |
| Range | 2017 |
| Interquartile range (IQR) | 62 |
Descriptive statistics
| Standard deviation | 284.36642 |
|---|---|
| Coefficient of variation (CV) | 0.14763099 |
| Kurtosis | 40.644989 |
| Mean | 1926.1973 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | -6.4397286 |
| Sum | 11734394 |
| Variance | 80864.262 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2015 | 141 | 2.3% |
| 2016 | 128 | 2.1% |
| 2011 | 128 | 2.1% |
| 2014 | 126 | 2.1% |
| 0 | 124 | 2.0% |
| 2013 | 122 | 2.0% |
| 2008 | 122 | 2.0% |
| 2009 | 120 | 2.0% |
| 2012 | 117 | 1.9% |
| 2007 | 112 | 1.8% |
| Other values (230) | 4852 |
| Value | Count | Frequency (%) |
| 0 | 124 | |
| 5 | 1 | < 0.1% |
| 77 | 1 | < 0.1% |
| 500 | 1 | < 0.1% |
| 1543 | 1 | < 0.1% |
| 1554 | 1 | < 0.1% |
| 1555 | 1 | < 0.1% |
| 1580 | 1 | < 0.1% |
| 1595 | 1 | < 0.1% |
| 1617 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2017 | 54 | 0.9% |
| 2016 | 128 | |
| 2015 | 141 | |
| 2014 | 126 | |
| 2013 | 122 | |
| 2012 | 117 | |
| 2011 | 128 | |
| 2010 | 101 | |
| 2009 | 120 | |
| 2008 | 122 |
Type
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 5 |
| Missing (%) | 0.1% |
| Memory size | 395.1 KiB |
| Unprovoked | |
|---|---|
| Provoked | |
| Invalid | |
| Sea Disaster | 220 |
| Boat | 202 |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 9.3735632 |
| Min length | 4 |
Characters and Unicode
| Total characters | 57085 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unprovoked |
|---|---|
| 2nd row | Unprovoked |
| 3rd row | Unprovoked |
| 4th row | Unprovoked |
| 5th row | Unprovoked |
Common Values
| Value | Count | Frequency (%) |
| Unprovoked | 4466 | |
| Provoked | 563 | 9.2% |
| Invalid | 529 | 8.7% |
| Sea Disaster | 220 | 3.6% |
| Boat | 202 | 3.3% |
| Boating | 110 | 1.8% |
| (Missing) | 5 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unprovoked | 4466 | |
| provoked | 563 | 8.9% |
| invalid | 529 | 8.4% |
| sea | 220 | 3.5% |
| disaster | 220 | 3.5% |
| boat | 202 | 3.2% |
| boating | 110 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 10370 | |
| v | 5558 | |
| d | 5558 | |
| e | 5469 | |
| r | 5249 | |
| n | 5105 | |
| k | 5029 | |
| U | 4466 | |
| p | 4466 | |
| a | 1281 | 2.2% |
| Other values (11) | 4534 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 50555 | |
| Uppercase Letter | 6310 | 11.1% |
| Space Separator | 220 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 10370 | |
| v | 5558 | |
| d | 5558 | |
| e | 5469 | |
| r | 5249 | |
| n | 5105 | |
| k | 5029 | |
| p | 4466 | |
| a | 1281 | 2.5% |
| i | 859 | 1.7% |
| Other values (4) | 1611 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4466 | |
| P | 563 | 8.9% |
| I | 529 | 8.4% |
| B | 312 | 4.9% |
| S | 220 | 3.5% |
| D | 220 | 3.5% |
Space Separator
| Value | Count | Frequency (%) |
| 220 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56865 | |
| Common | 220 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 10370 | |
| v | 5558 | |
| d | 5558 | |
| e | 5469 | |
| r | 5249 | |
| n | 5105 | |
| k | 5029 | |
| U | 4466 | |
| p | 4466 | |
| a | 1281 | 2.3% |
| Other values (10) | 4314 |
Common
| Value | Count | Frequency (%) |
| 220 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57085 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 10370 | |
| v | 5558 | |
| d | 5558 | |
| e | 5469 | |
| r | 5249 | |
| n | 5105 | |
| k | 5029 | |
| U | 4466 | |
| p | 4466 | |
| a | 1281 | 2.2% |
| Other values (11) | 4534 |
Country
Text
| Distinct | 204 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 47 |
| Missing (%) | 0.8% |
| Memory size | 380.0 KiB |
Length
| Max length | 37 |
|---|---|
| Median length | 30 |
| Mean length | 7.0656415 |
| Min length | 3 |
Characters and Unicode
| Total characters | 42733 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 81 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | AUSTRALIA |
|---|---|
| 2nd row | AUSTRALIA |
| 3rd row | USA |
| 4th row | UNITED KINGDOM |
| 5th row | USA |
| Value | Count | Frequency (%) |
| usa | 2160 | |
| australia | 1303 | |
| south | 594 | 8.0% |
| africa | 572 | 7.7% |
| new | 327 | 4.4% |
| guinea | 148 | 2.0% |
| papua | 133 | 1.8% |
| zealand | 126 | 1.7% |
| brazil | 103 | 1.4% |
| bahamas | 101 | 1.4% |
| Other values (203) | 1900 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 10069 | |
| U | 4687 | |
| S | 4682 | |
| I | 3573 | 8.4% |
| R | 2487 | 5.8% |
| T | 2349 | 5.5% |
| L | 2089 | 4.9% |
| N | 1775 | 4.2% |
| E | 1581 | 3.7% |
| 1437 | 3.4% | |
| Other values (43) | 8004 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 41197 | |
| Space Separator | 1437 | 3.4% |
| Lowercase Letter | 68 | 0.2% |
| Other Punctuation | 24 | 0.1% |
| Close Punctuation | 3 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10069 | |
| U | 4687 | |
| S | 4682 | |
| I | 3573 | 8.7% |
| R | 2487 | 6.0% |
| T | 2349 | 5.7% |
| L | 2089 | 5.1% |
| N | 1775 | 4.3% |
| E | 1581 | 3.8% |
| O | 1344 | 3.3% |
| Other values (16) | 6561 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 12 | |
| i | 10 | |
| r | 7 | |
| s | 5 | |
| o | 5 | |
| t | 5 | |
| n | 4 | 5.9% |
| a | 4 | 5.9% |
| l | 3 | 4.4% |
| j | 3 | 4.4% |
| Other values (8) | 10 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 9 | |
| / | 7 | |
| ? | 5 | |
| . | 2 | 8.3% |
| , | 1 | 4.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1437 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41265 | |
| Common | 1468 | 3.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 10069 | |
| U | 4687 | |
| S | 4682 | |
| I | 3573 | 8.7% |
| R | 2487 | 6.0% |
| T | 2349 | 5.7% |
| L | 2089 | 5.1% |
| N | 1775 | 4.3% |
| E | 1581 | 3.8% |
| O | 1344 | 3.3% |
| Other values (34) | 6629 |
Common
| Value | Count | Frequency (%) |
| 1437 | ||
| & | 9 | 0.6% |
| / | 7 | 0.5% |
| ? | 5 | 0.3% |
| ) | 3 | 0.2% |
| ( | 3 | 0.2% |
| . | 2 | 0.1% |
| - | 1 | 0.1% |
| , | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42733 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 10069 | |
| U | 4687 | |
| S | 4682 | |
| I | 3573 | 8.4% |
| R | 2487 | 5.8% |
| T | 2349 | 5.5% |
| L | 2089 | 4.9% |
| N | 1775 | 4.2% |
| E | 1581 | 3.7% |
| 1437 | 3.4% | |
| Other values (43) | 8004 |
Area
Text
MISSING 
| Distinct | 799 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 413 |
| Missing (%) | 6.8% |
| Memory size | 397.9 KiB |
Length
| Max length | 62 |
|---|---|
| Median length | 49 |
| Mean length | 12.106829 |
| Min length | 4 |
Characters and Unicode
| Total characters | 68791 |
|---|---|
| Distinct characters | 83 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 542 ? |
|---|---|
| Unique (%) | 9.5% |
Sample
| 1st row | Western Australia |
|---|---|
| 2nd row | Victoria |
| 3rd row | Florida |
| 4th row | South Devon |
| 5th row | Florida |
| Value | Count | Frequency (%) |
| florida | 1017 | 10.3% |
| south | 814 | 8.2% |
| province | 652 | 6.6% |
| new | 618 | 6.2% |
| wales | 476 | 4.8% |
| western | 391 | 3.9% |
| cape | 352 | 3.5% |
| queensland | 308 | 3.1% |
| hawaii | 295 | 3.0% |
| california | 292 | 2.9% |
| Other values (842) | 4703 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8247 | 12.0% |
| e | 5329 | 7.7% |
| r | 4927 | 7.2% |
| i | 4885 | 7.1% |
| o | 4565 | 6.6% |
| 4323 | 6.3% | |
| n | 4039 | 5.9% |
| l | 4015 | 5.8% |
| t | 3178 | 4.6% |
| s | 2952 | 4.3% |
| Other values (73) | 22331 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 53656 | |
| Uppercase Letter | 10171 | 14.8% |
| Space Separator | 4323 | 6.3% |
| Dash Punctuation | 310 | 0.5% |
| Decimal Number | 133 | 0.2% |
| Other Punctuation | 122 | 0.2% |
| Close Punctuation | 29 | < 0.1% |
| Open Punctuation | 29 | < 0.1% |
| Other Letter | 12 | < 0.1% |
| Control | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8247 | |
| e | 5329 | |
| r | 4927 | |
| i | 4885 | |
| o | 4565 | |
| n | 4039 | |
| l | 4015 | |
| t | 3178 | 5.9% |
| s | 2952 | 5.5% |
| u | 2486 | 4.6% |
| Other values (23) | 9033 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1187 | |
| C | 1125 | |
| N | 1117 | |
| F | 1032 | |
| W | 916 | 9.0% |
| P | 894 | 8.8% |
| A | 471 | 4.6% |
| I | 398 | 3.9% |
| H | 349 | 3.4% |
| Q | 323 | 3.2% |
| Other values (16) | 2359 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 38 | |
| 3 | 17 | |
| 2 | 17 | |
| 8 | 15 | 11.3% |
| 1 | 14 | 10.5% |
| 5 | 12 | 9.0% |
| 4 | 7 | 5.3% |
| 6 | 5 | 3.8% |
| 9 | 4 | 3.0% |
| 7 | 4 | 3.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 51 | |
| ' | 24 | |
| . | 21 | |
| & | 19 | 15.6% |
| " | 2 | 1.6% |
| / | 2 | 1.6% |
| ? | 2 | 1.6% |
| : | 1 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 4323 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 310 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 29 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 29 |
Other Letter
| Value | Count | Frequency (%) |
| º | 12 |
Control
| Value | Count | Frequency (%) |
| Â’ | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 63839 | |
| Common | 4952 | 7.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8247 | |
| e | 5329 | 8.3% |
| r | 4927 | 7.7% |
| i | 4885 | 7.7% |
| o | 4565 | 7.2% |
| n | 4039 | 6.3% |
| l | 4015 | 6.3% |
| t | 3178 | 5.0% |
| s | 2952 | 4.6% |
| u | 2486 | 3.9% |
| Other values (50) | 19216 |
Common
| Value | Count | Frequency (%) |
| 4323 | ||
| - | 310 | 6.3% |
| , | 51 | 1.0% |
| 0 | 38 | 0.8% |
| ) | 29 | 0.6% |
| ( | 29 | 0.6% |
| ' | 24 | 0.5% |
| . | 21 | 0.4% |
| & | 19 | 0.4% |
| 3 | 17 | 0.3% |
| Other values (13) | 91 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68743 | |
| None | 48 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8247 | 12.0% |
| e | 5329 | 7.8% |
| r | 4927 | 7.2% |
| i | 4885 | 7.1% |
| o | 4565 | 6.6% |
| 4323 | 6.3% | |
| n | 4039 | 5.9% |
| l | 4015 | 5.8% |
| t | 3178 | 4.6% |
| s | 2952 | 4.3% |
| Other values (63) | 22283 |
None
| Value | Count | Frequency (%) |
| º | 12 | |
| á | 6 | |
| é | 6 | |
| Â’ | 6 | |
| ó | 5 | |
| ã | 5 | |
| ô | 3 | 6.2% |
| î | 2 | 4.2% |
| É | 2 | 4.2% |
| ò | 1 | 2.1% |
Location
Text
MISSING 
| Distinct | 3984 |
|---|---|
| Distinct (%) | 71.4% |
| Missing | 512 |
| Missing (%) | 8.4% |
| Memory size | 458.4 KiB |
Length
| Max length | 119 |
|---|---|
| Median length | 79 |
| Mean length | 22.894859 |
| Min length | 3 |
Characters and Unicode
| Total characters | 127822 |
|---|---|
| Distinct characters | 99 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3309 ? |
|---|---|
| Unique (%) | 59.3% |
Sample
| 1st row | Point Casuarina, Bunbury |
|---|---|
| 2nd row | Flinders, Mornington Penisula |
| 3rd row | Ponce Inlet, Volusia County |
| 4th row | Bantham Beach |
| 5th row | Middle Sambo Reef off Boca Chica, Monroe County |
| Value | Count | Frequency (%) |
| beach | 1512 | 7.6% |
| county | 1436 | 7.2% |
| island | 599 | 3.0% |
| bay | 489 | 2.5% |
| of | 333 | 1.7% |
| volusia | 305 | 1.5% |
| off | 304 | 1.5% |
| river | 256 | 1.3% |
| near | 255 | 1.3% |
| new | 245 | 1.2% |
| Other values (3894) | 14134 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14629 | 11.4% | |
| a | 12755 | 10.0% |
| e | 9554 | 7.5% |
| o | 8042 | 6.3% |
| n | 8001 | 6.3% |
| r | 6120 | 4.8% |
| t | 5926 | 4.6% |
| i | 5135 | 4.0% |
| l | 4906 | 3.8% |
| u | 4397 | 3.4% |
| Other values (89) | 48357 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 90809 | |
| Uppercase Letter | 17522 | 13.7% |
| Space Separator | 14631 | 11.4% |
| Other Punctuation | 3738 | 2.9% |
| Decimal Number | 707 | 0.6% |
| Dash Punctuation | 141 | 0.1% |
| Open Punctuation | 93 | 0.1% |
| Close Punctuation | 93 | 0.1% |
| Control | 79 | 0.1% |
| Other Letter | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12755 | |
| e | 9554 | |
| o | 8042 | 8.9% |
| n | 8001 | 8.8% |
| r | 6120 | 6.7% |
| t | 5926 | 6.5% |
| i | 5135 | 5.7% |
| l | 4906 | 5.4% |
| u | 4397 | 4.8% |
| s | 4226 | 4.7% |
| Other values (30) | 21747 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2922 | |
| C | 2478 | |
| S | 1655 | 9.4% |
| P | 1344 | 7.7% |
| M | 1127 | 6.4% |
| I | 931 | 5.3% |
| R | 736 | 4.2% |
| N | 712 | 4.1% |
| H | 677 | 3.9% |
| L | 535 | 3.1% |
| Other values (18) | 4405 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 198 | |
| 1 | 114 | |
| 2 | 88 | |
| 5 | 84 | |
| 3 | 61 | 8.6% |
| 4 | 44 | 6.2% |
| 6 | 38 | 5.4% |
| 7 | 33 | 4.7% |
| 8 | 30 | 4.2% |
| 9 | 17 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3158 | |
| ' | 292 | 7.8% |
| . | 177 | 4.7% |
| & | 52 | 1.4% |
| / | 29 | 0.8% |
| ? | 19 | 0.5% |
| " | 10 | 0.3% |
| : | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| Â’ | 73 | |
| ‘ | 2 | 2.5% |
| ” | 1 | 1.3% |
| “ | 1 | 1.3% |
| š | 1 | 1.3% |
| – | 1 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 14629 | ||
| Â | 2 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 141 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 93 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 93 |
Other Letter
| Value | Count | Frequency (%) |
| º | 8 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 108339 | |
| Common | 19483 | 15.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12755 | 11.8% |
| e | 9554 | 8.8% |
| o | 8042 | 7.4% |
| n | 8001 | 7.4% |
| r | 6120 | 5.6% |
| t | 5926 | 5.5% |
| i | 5135 | 4.7% |
| l | 4906 | 4.5% |
| u | 4397 | 4.1% |
| s | 4226 | 3.9% |
| Other values (59) | 39277 |
Common
| Value | Count | Frequency (%) |
| 14629 | ||
| , | 3158 | 16.2% |
| ' | 292 | 1.5% |
| 0 | 198 | 1.0% |
| . | 177 | 0.9% |
| - | 141 | 0.7% |
| 1 | 114 | 0.6% |
| ( | 93 | 0.5% |
| ) | 93 | 0.5% |
| 2 | 88 | 0.5% |
| Other values (20) | 500 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 127672 | |
| None | 150 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 14629 | 11.5% | |
| a | 12755 | 10.0% |
| e | 9554 | 7.5% |
| o | 8042 | 6.3% |
| n | 8001 | 6.3% |
| r | 6120 | 4.8% |
| t | 5926 | 4.6% |
| i | 5135 | 4.0% |
| l | 4906 | 3.8% |
| u | 4397 | 3.4% |
| Other values (64) | 48207 |
None
| Value | Count | Frequency (%) |
| Â’ | 73 | |
| é | 17 | 11.3% |
| º | 8 | 5.3% |
| ã | 7 | 4.7% |
| á | 7 | 4.7% |
| ñ | 4 | 2.7% |
| è | 4 | 2.7% |
| ó | 4 | 2.7% |
| ú | 3 | 2.0% |
| ÃŽ | 3 | 2.0% |
| Other values (15) | 20 | 13.3% |
Activity
Text
MISSING 
| Distinct | 1503 |
|---|---|
| Distinct (%) | 27.0% |
| Missing | 536 |
| Missing (%) | 8.8% |
| Memory size | 418.3 KiB |
Length
| Max length | 255 |
|---|---|
| Median length | 242 |
| Mean length | 16.647958 |
| Min length | 1 |
Characters and Unicode
| Total characters | 92546 |
|---|---|
| Distinct characters | 80 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1289 ? |
|---|---|
| Unique (%) | 23.2% |
Sample
| 1st row | Body boarding |
|---|---|
| 2nd row | Surfing |
| 3rd row | Surfing |
| 4th row | Surfing |
| 5th row | Spearfishing |
| Value | Count | Frequency (%) |
| swimming | 1077 | 7.5% |
| surfing | 1060 | 7.4% |
| fishing | 713 | 5.0% |
| diving | 539 | 3.8% |
| spearfishing | 423 | 3.0% |
| the | 352 | 2.5% |
| 267 | 1.9% | |
| in | 245 | 1.7% |
| a | 236 | 1.7% |
| for | 208 | 1.5% |
| Other values (1979) | 9159 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 10695 | 11.6% |
| 8982 | 9.7% | |
| n | 8045 | 8.7% |
| g | 6069 | 6.6% |
| r | 5346 | 5.8% |
| a | 5294 | 5.7% |
| e | 5257 | 5.7% |
| s | 3805 | 4.1% |
| o | 3640 | 3.9% |
| t | 3363 | 3.6% |
| Other values (70) | 32050 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 75206 | |
| Space Separator | 8982 | 9.7% |
| Uppercase Letter | 6549 | 7.1% |
| Other Punctuation | 853 | 0.9% |
| Decimal Number | 584 | 0.6% |
| Dash Punctuation | 170 | 0.2% |
| Close Punctuation | 93 | 0.1% |
| Open Punctuation | 93 | 0.1% |
| Control | 16 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 10695 | |
| n | 8045 | 10.7% |
| g | 6069 | 8.1% |
| r | 5346 | 7.1% |
| a | 5294 | 7.0% |
| e | 5257 | 7.0% |
| s | 3805 | 5.1% |
| o | 3640 | 4.8% |
| t | 3363 | 4.5% |
| h | 3252 | 4.3% |
| Other values (18) | 20440 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3177 | |
| F | 884 | 13.5% |
| B | 478 | 7.3% |
| D | 330 | 5.0% |
| W | 304 | 4.6% |
| A | 174 | 2.7% |
| P | 163 | 2.5% |
| C | 154 | 2.4% |
| T | 149 | 2.3% |
| H | 93 | 1.4% |
| Other values (15) | 643 | 9.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 101 | |
| 2 | 85 | |
| 0 | 72 | |
| 4 | 65 | |
| 3 | 62 | |
| 5 | 57 | |
| 9 | 40 | 6.8% |
| 7 | 39 | 6.7% |
| 6 | 35 | 6.0% |
| 8 | 28 | 4.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 338 | |
| & | 178 | |
| . | 138 | |
| / | 100 | 11.7% |
| ' | 56 | 6.6% |
| " | 28 | 3.3% |
| ? | 9 | 1.1% |
| : | 4 | 0.5% |
| ; | 2 | 0.2% |
Control
| Value | Count | Frequency (%) |
| Â’ | 12 | |
| “ | 2 | 12.5% |
| – | 1 | 6.2% |
| ” | 1 | 6.2% |
Space Separator
| Value | Count | Frequency (%) |
| 8982 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 170 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 93 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 93 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 81755 | |
| Common | 10791 | 11.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 10695 | 13.1% |
| n | 8045 | 9.8% |
| g | 6069 | 7.4% |
| r | 5346 | 6.5% |
| a | 5294 | 6.5% |
| e | 5257 | 6.4% |
| s | 3805 | 4.7% |
| o | 3640 | 4.5% |
| t | 3363 | 4.1% |
| h | 3252 | 4.0% |
| Other values (43) | 26989 |
Common
| Value | Count | Frequency (%) |
| 8982 | ||
| , | 338 | 3.1% |
| & | 178 | 1.6% |
| - | 170 | 1.6% |
| . | 138 | 1.3% |
| 1 | 101 | 0.9% |
| / | 100 | 0.9% |
| ) | 93 | 0.9% |
| ( | 93 | 0.9% |
| 2 | 85 | 0.8% |
| Other values (17) | 513 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 92528 | |
| None | 18 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 10695 | 11.6% |
| 8982 | 9.7% | |
| n | 8045 | 8.7% |
| g | 6069 | 6.6% |
| r | 5346 | 5.8% |
| a | 5294 | 5.7% |
| e | 5257 | 5.7% |
| s | 3805 | 4.1% |
| o | 3640 | 3.9% |
| t | 3363 | 3.6% |
| Other values (64) | 32032 |
None
| Value | Count | Frequency (%) |
| Â’ | 12 | |
| “ | 2 | 11.1% |
| – | 1 | 5.6% |
| ê | 1 | 5.6% |
| Ã | 1 | 5.6% |
| ” | 1 | 5.6% |
Name
Text
MISSING 
| Distinct | 5086 |
|---|---|
| Distinct (%) | 86.4% |
| Missing | 207 |
| Missing (%) | 3.4% |
| Memory size | 424.8 KiB |
Length
| Max length | 222 |
|---|---|
| Median length | 111 |
| Mean length | 15.120924 |
| Min length | 1 |
Characters and Unicode
| Total characters | 89032 |
|---|---|
| Distinct characters | 100 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 4998 ? |
|---|---|
| Unique (%) | 84.9% |
Sample
| 1st row | Paul Goff |
|---|---|
| 2nd row | female |
| 3rd row | Bryan Brock |
| 4th row | Rich Thomson |
| 5th row | Parker Simpson |
| Value | Count | Frequency (%) |
| male | 604 | 4.1% |
| a | 298 | 2.0% |
| 232 | 1.6% | |
| boat | 174 | 1.2% |
| john | 162 | 1.1% |
| occupants | 153 | 1.0% |
| female | 112 | 0.8% |
| the | 95 | 0.6% |
| william | 92 | 0.6% |
| james | 86 | 0.6% |
| Other values (6013) | 12618 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9205 | 10.3% | |
| a | 8210 | 9.2% |
| e | 7850 | 8.8% |
| n | 5604 | 6.3% |
| r | 5579 | 6.3% |
| o | 5118 | 5.7% |
| i | 4603 | 5.2% |
| l | 4338 | 4.9% |
| s | 3375 | 3.8% |
| t | 3275 | 3.7% |
| Other values (90) | 31875 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65805 | |
| Uppercase Letter | 11391 | 12.8% |
| Space Separator | 9207 | 10.3% |
| Other Punctuation | 1906 | 2.1% |
| Decimal Number | 441 | 0.5% |
| Dash Punctuation | 110 | 0.1% |
| Open Punctuation | 68 | 0.1% |
| Close Punctuation | 68 | 0.1% |
| Control | 27 | < 0.1% |
| Connector Punctuation | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8210 | |
| e | 7850 | |
| n | 5604 | 8.5% |
| r | 5579 | 8.5% |
| o | 5118 | 7.8% |
| i | 4603 | 7.0% |
| l | 4338 | 6.6% |
| s | 3375 | 5.1% |
| t | 3275 | 5.0% |
| m | 2360 | 3.6% |
| Other values (29) | 15493 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1035 | 9.1% |
| J | 928 | 8.1% |
| S | 906 | 8.0% |
| C | 800 | 7.0% |
| B | 726 | 6.4% |
| A | 698 | 6.1% |
| R | 673 | 5.9% |
| D | 599 | 5.3% |
| H | 543 | 4.8% |
| G | 523 | 4.6% |
| Other values (17) | 3960 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 861 | |
| , | 480 | |
| & | 215 | 11.3% |
| : | 187 | 9.8% |
| ' | 98 | 5.1% |
| " | 49 | 2.6% |
| ; | 10 | 0.5% |
| / | 2 | 0.1% |
| ? | 2 | 0.1% |
| # | 1 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 119 | |
| 1 | 83 | |
| 4 | 53 | |
| 5 | 48 | |
| 3 | 35 | 7.9% |
| 6 | 29 | 6.6% |
| 0 | 25 | 5.7% |
| 8 | 20 | 4.5% |
| 7 | 17 | 3.9% |
| 9 | 12 | 2.7% |
Control
| Value | Count | Frequency (%) |
| Â’ | 15 | |
| 4 | 14.8% | |
| ” | 3 | 11.1% |
| “ | 3 | 11.1% |
| ‘ | 1 | 3.7% |
| Â… | 1 | 3.7% |
Space Separator
| Value | Count | Frequency (%) |
| 9205 | ||
| Â | 2 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 110 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 68 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 68 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 77196 | |
| Common | 11836 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8210 | 10.6% |
| e | 7850 | 10.2% |
| n | 5604 | 7.3% |
| r | 5579 | 7.2% |
| o | 5118 | 6.6% |
| i | 4603 | 6.0% |
| l | 4338 | 5.6% |
| s | 3375 | 4.4% |
| t | 3275 | 4.2% |
| m | 2360 | 3.1% |
| Other values (56) | 26884 |
Common
| Value | Count | Frequency (%) |
| 9205 | ||
| . | 861 | 7.3% |
| , | 480 | 4.1% |
| & | 215 | 1.8% |
| : | 187 | 1.6% |
| 2 | 119 | 1.0% |
| - | 110 | 0.9% |
| ' | 98 | 0.8% |
| 1 | 83 | 0.7% |
| ( | 68 | 0.6% |
| Other values (24) | 410 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 88936 | |
| None | 96 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9205 | 10.4% | |
| a | 8210 | 9.2% |
| e | 7850 | 8.8% |
| n | 5604 | 6.3% |
| r | 5579 | 6.3% |
| o | 5118 | 5.8% |
| i | 4603 | 5.2% |
| l | 4338 | 4.9% |
| s | 3375 | 3.8% |
| t | 3275 | 3.7% |
| Other values (70) | 31779 |
None
| Value | Count | Frequency (%) |
| é | 32 | |
| Â’ | 15 | |
| á | 8 | 8.3% |
| ã | 5 | 5.2% |
| Ã | 5 | 5.2% |
| ó | 4 | 4.2% |
| ú | 4 | 4.2% |
| ” | 3 | 3.1% |
| “ | 3 | 3.1% |
| Â | 2 | 2.1% |
| Other values (10) | 15 |
Sex
Categorical
IMBALANCE  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 578 |
| Missing (%) | 9.5% |
| Memory size | 335.2 KiB |
| M | |
|---|---|
| F | |
| M | 2 |
| lli | 1 |
| N | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.000725 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5521 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | M |
|---|---|
| 2nd row | F |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 4906 | |
| F | 606 | 9.9% |
| M | 2 | < 0.1% |
| lli | 1 | < 0.1% |
| N | 1 | < 0.1% |
| . | 1 | < 0.1% |
| (Missing) | 578 | 9.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 4908 | |
| f | 606 | 11.0% |
| lli | 1 | < 0.1% |
| n | 1 | < 0.1% |
| 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 4908 | |
| F | 606 | 11.0% |
| 2 | < 0.1% | |
| l | 2 | < 0.1% |
| i | 1 | < 0.1% |
| N | 1 | < 0.1% |
| . | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5515 | |
| Lowercase Letter | 3 | 0.1% |
| Space Separator | 2 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 4908 | |
| F | 606 | 11.0% |
| N | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2 | |
| i | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5518 | |
| Common | 3 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 4908 | |
| F | 606 | 11.0% |
| l | 2 | < 0.1% |
| i | 1 | < 0.1% |
| N | 1 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5521 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 4908 | |
| F | 606 | 11.0% |
| 2 | < 0.1% | |
| l | 2 | < 0.1% |
| i | 1 | < 0.1% |
| N | 1 | < 0.1% |
| . | 1 | < 0.1% |
Age
Text
MISSING 
| Distinct | 151 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 2721 |
| Missing (%) | 44.6% |
| Memory size | 279.9 KiB |
Length
| Max length | 23 |
|---|---|
| Median length | 2 |
| Mean length | 2.0815056 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7023 |
|---|---|
| Distinct characters | 52 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 68 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | 48 |
|---|---|
| 2nd row | 19 |
| 3rd row | 30 |
| 4th row | 32 |
| 5th row | 20 |
| Value | Count | Frequency (%) |
| 17 | 154 | 4.4% |
| 18 | 151 | 4.4% |
| 19 | 142 | 4.1% |
| 20 | 142 | 4.1% |
| 16 | 138 | 4.0% |
| 15 | 135 | 3.9% |
| 21 | 120 | 3.5% |
| 22 | 115 | 3.3% |
| 24 | 104 | 3.0% |
| 25 | 104 | 3.0% |
| Other values (100) | 2160 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1362 | |
| 2 | 1342 | |
| 3 | 846 | |
| 4 | 644 | |
| 5 | 566 | |
| 0 | 407 | 5.8% |
| 6 | 403 | 5.7% |
| 7 | 371 | 5.3% |
| 8 | 366 | 5.2% |
| 9 | 341 | 4.9% |
| Other values (42) | 375 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6648 | |
| Lowercase Letter | 178 | 2.5% |
| Space Separator | 115 | 1.6% |
| Other Punctuation | 42 | 0.6% |
| Uppercase Letter | 32 | 0.5% |
| Dash Punctuation | 3 | < 0.1% |
| Other Number | 2 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 33 | |
| o | 26 | |
| s | 25 | |
| n | 21 | |
| r | 16 | |
| t | 15 | |
| d | 7 | 3.9% |
| m | 6 | 3.4% |
| l | 5 | 2.8% |
| u | 5 | 2.8% |
| Other values (5) | 19 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 10 | |
| E | 5 | |
| M | 3 | 9.4% |
| F | 2 | 6.2% |
| A | 2 | 6.2% |
| N | 2 | 6.2% |
| C | 1 | 3.1% |
| K | 1 | 3.1% |
| B | 1 | 3.1% |
| X | 1 | 3.1% |
| Other values (4) | 4 | 12.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1362 | |
| 2 | 1342 | |
| 3 | 846 | |
| 4 | 644 | |
| 5 | 566 | |
| 0 | 407 | 6.1% |
| 6 | 403 | 6.1% |
| 7 | 371 | 5.6% |
| 8 | 366 | 5.5% |
| 9 | 341 | 5.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 22 | |
| , | 7 | 16.7% |
| ? | 5 | 11.9% |
| " | 4 | 9.5% |
| . | 3 | 7.1% |
| ' | 1 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 114 | ||
| Â | 1 | 0.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6813 | |
| Latin | 210 | 3.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 33 | |
| o | 26 | |
| s | 25 | |
| n | 21 | |
| r | 16 | 7.6% |
| t | 15 | 7.1% |
| T | 10 | 4.8% |
| d | 7 | 3.3% |
| m | 6 | 2.9% |
| l | 5 | 2.4% |
| Other values (19) | 46 |
Common
| Value | Count | Frequency (%) |
| 1 | 1362 | |
| 2 | 1342 | |
| 3 | 846 | |
| 4 | 644 | |
| 5 | 566 | |
| 0 | 407 | 6.0% |
| 6 | 403 | 5.9% |
| 7 | 371 | 5.4% |
| 8 | 366 | 5.4% |
| 9 | 341 | 5.0% |
| Other values (13) | 165 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7020 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1362 | |
| 2 | 1342 | |
| 3 | 846 | |
| 4 | 644 | |
| 5 | 566 | |
| 0 | 407 | 5.8% |
| 6 | 403 | 5.7% |
| 7 | 371 | 5.3% |
| 8 | 366 | 5.2% |
| 9 | 341 | 4.9% |
| Other values (40) | 372 | 5.3% |
None
| Value | Count | Frequency (%) |
| ½ | 2 | |
| Â | 1 |
Injury
Text
| Distinct | 3645 |
|---|---|
| Distinct (%) | 60.1% |
| Missing | 29 |
| Missing (%) | 0.5% |
| Memory size | 531.4 KiB |
Length
| Max length | 235 |
|---|---|
| Median length | 152 |
| Mean length | 31.925816 |
| Min length | 5 |
Characters and Unicode
| Total characters | 193662 |
|---|---|
| Distinct characters | 81 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3267 ? |
|---|---|
| Unique (%) | 53.9% |
Sample
| 1st row | No injury, board bitten |
|---|---|
| 2nd row | No injury, knocke off board |
| 3rd row | Laceration to left foot |
| 4th row | Bruise to leg, cuts to hand sustained when he hit the shark |
| 5th row | Laceration to shin |
| Value | Count | Frequency (%) |
| bitten | 1539 | 4.7% |
| to | 1524 | 4.6% |
| fatal | 1296 | 3.9% |
| shark | 1227 | 3.7% |
| 1024 | 3.1% | |
| injury | 915 | 2.8% |
| no | 871 | 2.6% |
| leg | 863 | 2.6% |
| right | 829 | 2.5% |
| left | 820 | 2.5% |
| Other values (1940) | 22066 |
Most occurring characters
| Value | Count | Frequency (%) |
| 28050 | ||
| e | 16361 | 8.4% |
| t | 14291 | 7.4% |
| a | 11101 | 5.7% |
| r | 11100 | 5.7% |
| o | 11008 | 5.7% |
| i | 9375 | 4.8% |
| n | 9297 | 4.8% |
| s | 7112 | 3.7% |
| h | 6350 | 3.3% |
| Other values (71) | 69617 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 140004 | |
| Space Separator | 28050 | 14.5% |
| Uppercase Letter | 20532 | 10.6% |
| Other Punctuation | 3856 | 2.0% |
| Decimal Number | 953 | 0.5% |
| Dash Punctuation | 127 | 0.1% |
| Open Punctuation | 49 | < 0.1% |
| Close Punctuation | 49 | < 0.1% |
| Control | 38 | < 0.1% |
| Math Symbol | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 16361 | |
| t | 14291 | 10.2% |
| a | 11101 | 7.9% |
| r | 11100 | 7.9% |
| o | 11008 | 7.9% |
| i | 9375 | 6.7% |
| n | 9297 | 6.6% |
| s | 7112 | 5.1% |
| h | 6350 | 4.5% |
| d | 5787 | 4.1% |
| Other values (17) | 38222 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2820 | |
| L | 2512 | |
| T | 2041 | |
| N | 1963 | |
| F | 1566 | 7.6% |
| I | 1220 | 5.9% |
| D | 1209 | 5.9% |
| O | 1154 | 5.6% |
| E | 1137 | 5.5% |
| R | 950 | 4.6% |
| Other values (14) | 3960 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 174 | |
| 2 | 172 | |
| 3 | 145 | |
| 0 | 105 | |
| 5 | 91 | |
| 4 | 86 | |
| 6 | 57 | 6.0% |
| 9 | 47 | 4.9% |
| 8 | 42 | 4.4% |
| 7 | 34 | 3.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2016 | |
| & | 994 | |
| . | 373 | 9.7% |
| " | 170 | 4.4% |
| ' | 152 | 3.9% |
| ; | 67 | 1.7% |
| / | 56 | 1.5% |
| : | 20 | 0.5% |
| ? | 8 | 0.2% |
Control
| Value | Count | Frequency (%) |
| Â’ | 24 | |
| “ | 7 | 18.4% |
| ” | 7 | 18.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 33 | |
| [ | 16 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 33 | |
| ] | 16 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 | |
| > | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 28050 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 127 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 160536 | |
| Common | 33126 | 17.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 16361 | 10.2% |
| t | 14291 | 8.9% |
| a | 11101 | 6.9% |
| r | 11100 | 6.9% |
| o | 11008 | 6.9% |
| i | 9375 | 5.8% |
| n | 9297 | 5.8% |
| s | 7112 | 4.4% |
| h | 6350 | 4.0% |
| d | 5787 | 3.6% |
| Other values (41) | 58754 |
Common
| Value | Count | Frequency (%) |
| 28050 | ||
| , | 2016 | 6.1% |
| & | 994 | 3.0% |
| . | 373 | 1.1% |
| 1 | 174 | 0.5% |
| 2 | 172 | 0.5% |
| " | 170 | 0.5% |
| ' | 152 | 0.5% |
| 3 | 145 | 0.4% |
| - | 127 | 0.4% |
| Other values (20) | 753 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 193623 | |
| None | 39 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 28050 | ||
| e | 16361 | 8.4% |
| t | 14291 | 7.4% |
| a | 11101 | 5.7% |
| r | 11100 | 5.7% |
| o | 11008 | 5.7% |
| i | 9375 | 4.8% |
| n | 9297 | 4.8% |
| s | 7112 | 3.7% |
| h | 6350 | 3.3% |
| Other values (67) | 69578 |
None
| Value | Count | Frequency (%) |
| Â’ | 24 | |
| “ | 7 | 17.9% |
| ” | 7 | 17.9% |
| ê | 1 | 2.6% |
Fatal (Y/N)
Categorical
IMBALANCE 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 31 |
| Missing (%) | 0.5% |
| Memory size | 345.4 KiB |
| N | |
|---|---|
| Y | |
| UNKNOWN | 94 |
| N | 8 |
| 2017 | 1 |
| Other values (4) | 4 |
Length
| Max length | 7 |
|---|---|
| Median length | 1 |
| Mean length | 1.0959763 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6646 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | N |
|---|---|
| 2nd row | N |
| 3rd row | N |
| 4th row | N |
| 5th row | N |
Common Values
| Value | Count | Frequency (%) |
| N | 4391 | |
| Y | 1566 | 25.7% |
| UNKNOWN | 94 | 1.5% |
| N | 8 | 0.1% |
| 2017 | 1 | < 0.1% |
| F | 1 | < 0.1% |
| N | 1 | < 0.1% |
| #VALUE! | 1 | < 0.1% |
| n | 1 | < 0.1% |
| (Missing) | 31 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| n | 4401 | |
| y | 1566 | 25.8% |
| unknown | 94 | 1.6% |
| 2017 | 1 | < 0.1% |
| f | 1 | < 0.1% |
| value | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 4682 | |
| Y | 1566 | 23.6% |
| U | 95 | 1.4% |
| K | 94 | 1.4% |
| O | 94 | 1.4% |
| W | 94 | 1.4% |
| 9 | 0.1% | |
| V | 1 | < 0.1% |
| ! | 1 | < 0.1% |
| E | 1 | < 0.1% |
| Other values (9) | 9 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6630 | |
| Space Separator | 9 | 0.1% |
| Decimal Number | 4 | 0.1% |
| Other Punctuation | 2 | < 0.1% |
| Lowercase Letter | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4682 | |
| Y | 1566 | 23.6% |
| U | 95 | 1.4% |
| K | 94 | 1.4% |
| O | 94 | 1.4% |
| W | 94 | 1.4% |
| V | 1 | < 0.1% |
| E | 1 | < 0.1% |
| L | 1 | < 0.1% |
| A | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 7 | 1 | |
| 0 | 1 | |
| 2 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| ! | 1 | |
| # | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 9 |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6631 | |
| Common | 15 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 4682 | |
| Y | 1566 | 23.6% |
| U | 95 | 1.4% |
| K | 94 | 1.4% |
| O | 94 | 1.4% |
| W | 94 | 1.4% |
| V | 1 | < 0.1% |
| E | 1 | < 0.1% |
| L | 1 | < 0.1% |
| A | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 9 | ||
| ! | 1 | 6.7% |
| 1 | 1 | 6.7% |
| # | 1 | 6.7% |
| 7 | 1 | 6.7% |
| 0 | 1 | 6.7% |
| 2 | 1 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6646 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 4682 | |
| Y | 1566 | 23.6% |
| U | 95 | 1.4% |
| K | 94 | 1.4% |
| O | 94 | 1.4% |
| W | 94 | 1.4% |
| 9 | 0.1% | |
| V | 1 | < 0.1% |
| ! | 1 | < 0.1% |
| E | 1 | < 0.1% |
| Other values (9) | 9 | 0.1% |
Time
Text
MISSING 
| Distinct | 360 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 3247 |
| Missing (%) | 53.3% |
| Memory size | 276.2 KiB |
Length
| Max length | 69 |
|---|---|
| Median length | 5 |
| Mean length | 5.7865169 |
| Min length | 1 |
Characters and Unicode
| Total characters | 16480 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 197 ? |
|---|---|
| Unique (%) | 6.9% |
Sample
| 1st row | 08h30 |
|---|---|
| 2nd row | 15h45 |
| 3rd row | 10h00 |
| 4th row | Shortly before 12h00 |
| 5th row | Morning |
| Value | Count | Frequency (%) |
| afternoon | 223 | 7.4% |
| 11h00 | 131 | 4.3% |
| morning | 127 | 4.2% |
| 12h00 | 113 | 3.7% |
| 15h00 | 103 | 3.4% |
| 14h00 | 99 | 3.3% |
| 16h00 | 98 | 3.2% |
| 16h30 | 74 | 2.4% |
| 14h30 | 74 | 2.4% |
| 13h00 | 73 | 2.4% |
| Other values (310) | 1912 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3522 | |
| h | 2402 | |
| 1 | 2367 | |
| 3 | 918 | 5.6% |
| n | 826 | 5.0% |
| 5 | 700 | 4.2% |
| o | 620 | 3.8% |
| 4 | 470 | 2.9% |
| r | 419 | 2.5% |
| t | 382 | 2.3% |
| Other values (51) | 3854 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9336 | |
| Lowercase Letter | 6238 | |
| Uppercase Letter | 586 | 3.6% |
| Space Separator | 198 | 1.2% |
| Other Punctuation | 82 | 0.5% |
| Dash Punctuation | 27 | 0.2% |
| Math Symbol | 7 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| h | 2402 | |
| n | 826 | 13.2% |
| o | 620 | 9.9% |
| r | 419 | 6.7% |
| t | 382 | 6.1% |
| e | 373 | 6.0% |
| i | 270 | 4.3% |
| f | 252 | 4.0% |
| g | 234 | 3.8% |
| a | 140 | 2.2% |
| Other values (13) | 320 | 5.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 206 | |
| M | 158 | |
| N | 62 | 10.6% |
| E | 55 | 9.4% |
| L | 39 | 6.7% |
| D | 22 | 3.8% |
| P | 15 | 2.6% |
| S | 12 | 2.0% |
| B | 6 | 1.0% |
| J | 6 | 1.0% |
| Other values (5) | 5 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3522 | |
| 1 | 2367 | |
| 3 | 918 | 9.8% |
| 5 | 700 | 7.5% |
| 4 | 470 | 5.0% |
| 2 | 375 | 4.0% |
| 7 | 287 | 3.1% |
| 6 | 287 | 3.1% |
| 8 | 230 | 2.5% |
| 9 | 180 | 1.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 55 | |
| " | 14 | 17.1% |
| / | 8 | 9.8% |
| & | 3 | 3.7% |
| ? | 1 | 1.2% |
| : | 1 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 197 | ||
| Â | 1 | 0.5% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 6 | |
| < | 1 | 14.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 27 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9656 | |
| Latin | 6824 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| h | 2402 | |
| n | 826 | 12.1% |
| o | 620 | 9.1% |
| r | 419 | 6.1% |
| t | 382 | 5.6% |
| e | 373 | 5.5% |
| i | 270 | 4.0% |
| f | 252 | 3.7% |
| g | 234 | 3.4% |
| A | 206 | 3.0% |
| Other values (28) | 840 | 12.3% |
Common
| Value | Count | Frequency (%) |
| 0 | 3522 | |
| 1 | 2367 | |
| 3 | 918 | 9.5% |
| 5 | 700 | 7.2% |
| 4 | 470 | 4.9% |
| 2 | 375 | 3.9% |
| 7 | 287 | 3.0% |
| 6 | 287 | 3.0% |
| 8 | 230 | 2.4% |
| 197 | 2.0% | |
| Other values (13) | 303 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16479 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3522 | |
| h | 2402 | |
| 1 | 2367 | |
| 3 | 918 | 5.6% |
| n | 826 | 5.0% |
| 5 | 700 | 4.2% |
| o | 620 | 3.8% |
| 4 | 470 | 2.9% |
| r | 419 | 2.5% |
| t | 382 | 2.3% |
| Other values (50) | 3853 |
None
| Value | Count | Frequency (%) |
| Â | 1 |
Species
Text
MISSING 
| Distinct | 1554 |
|---|---|
| Distinct (%) | 50.1% |
| Missing | 2996 |
| Missing (%) | 49.2% |
| Memory size | 336.2 KiB |
Length
| Max length | 196 |
|---|---|
| Median length | 136 |
| Mean length | 22.668925 |
| Min length | 1 |
Characters and Unicode
| Total characters | 70251 |
|---|---|
| Distinct characters | 86 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1226 ? |
|---|---|
| Unique (%) | 39.6% |
Sample
| 1st row | White shark, 4 m |
|---|---|
| 2nd row | 7 gill shark |
| 3rd row | 3m shark, probably a smooth hound |
| 4th row | 8' shark |
| 5th row | Tiger shark |
| Value | Count | Frequency (%) |
| shark | 3002 | |
| m | 1440 | 10.3% |
| to | 768 | 5.5% |
| white | 630 | 4.5% |
| 6 | 306 | 2.2% |
| 4 | 299 | 2.1% |
| 5 | 296 | 2.1% |
| 3 | 276 | 2.0% |
| tiger | 271 | 1.9% |
| a | 244 | 1.7% |
| Other values (862) | 6496 |
Most occurring characters
| Value | Count | Frequency (%) |
| 11558 | ||
| r | 4686 | 6.7% |
| a | 4633 | 6.6% |
| h | 4387 | 6.2% |
| s | 3969 | 5.6% |
| e | 3533 | 5.0% |
| k | 3430 | 4.9% |
| t | 2774 | 3.9% |
| o | 2420 | 3.4% |
| i | 2391 | 3.4% |
| Other values (76) | 26470 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 43518 | |
| Space Separator | 11561 | 16.5% |
| Decimal Number | 6231 | 8.9% |
| Other Punctuation | 4616 | 6.6% |
| Uppercase Letter | 2038 | 2.9% |
| Close Punctuation | 1028 | 1.5% |
| Open Punctuation | 1028 | 1.5% |
| Dash Punctuation | 177 | 0.3% |
| Math Symbol | 28 | < 0.1% |
| Control | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 4686 | |
| a | 4633 | |
| h | 4387 | |
| s | 3969 | |
| e | 3533 | 8.1% |
| k | 3430 | 7.9% |
| t | 2774 | 6.4% |
| o | 2420 | 5.6% |
| i | 2391 | 5.5% |
| m | 2292 | 5.3% |
| Other values (17) | 9003 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 494 | |
| T | 308 | |
| B | 299 | |
| S | 256 | |
| N | 80 | 3.9% |
| G | 68 | 3.3% |
| R | 67 | 3.3% |
| P | 63 | 3.1% |
| C | 58 | 2.8% |
| M | 55 | 2.7% |
| Other values (15) | 290 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1952 | |
| . | 1327 | |
| , | 973 | |
| " | 259 | 5.6% |
| ? | 41 | 0.9% |
| & | 37 | 0.8% |
| ; | 12 | 0.3% |
| / | 8 | 0.2% |
| : | 6 | 0.1% |
| * | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1308 | |
| 5 | 966 | |
| 2 | 837 | |
| 4 | 655 | |
| 3 | 616 | |
| 6 | 538 | |
| 0 | 444 | 7.1% |
| 8 | 422 | 6.8% |
| 7 | 276 | 4.4% |
| 9 | 169 | 2.7% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 21 | |
| < | 4 | 14.3% |
| + | 3 | 10.7% |
Control
| Value | Count | Frequency (%) |
| ” | 12 | |
| “ | 10 | |
| Â’ | 3 | 12.0% |
Space Separator
| Value | Count | Frequency (%) |
| 11558 | ||
| Â | 3 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1000 | |
| ) | 28 | 2.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1000 | |
| ( | 28 | 2.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 177 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45556 | |
| Common | 24695 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 4686 | |
| a | 4633 | |
| h | 4387 | 9.6% |
| s | 3969 | 8.7% |
| e | 3533 | 7.8% |
| k | 3430 | 7.5% |
| t | 2774 | 6.1% |
| o | 2420 | 5.3% |
| i | 2391 | 5.2% |
| m | 2292 | 5.0% |
| Other values (42) | 11041 |
Common
| Value | Count | Frequency (%) |
| 11558 | ||
| ' | 1952 | 7.9% |
| . | 1327 | 5.4% |
| 1 | 1308 | 5.3% |
| ] | 1000 | 4.0% |
| [ | 1000 | 4.0% |
| , | 973 | 3.9% |
| 5 | 966 | 3.9% |
| 2 | 837 | 3.4% |
| 4 | 655 | 2.7% |
| Other values (24) | 3119 | 12.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70221 | |
| None | 30 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 11558 | ||
| r | 4686 | 6.7% |
| a | 4633 | 6.6% |
| h | 4387 | 6.2% |
| s | 3969 | 5.7% |
| e | 3533 | 5.0% |
| k | 3430 | 4.9% |
| t | 2774 | 4.0% |
| o | 2420 | 3.4% |
| i | 2391 | 3.4% |
| Other values (70) | 26440 |
None
| Value | Count | Frequency (%) |
| ” | 12 | |
| “ | 10 | |
| Â | 3 | 10.0% |
| Â’ | 3 | 10.0% |
| ½ | 1 | 3.3% |
| ã | 1 | 3.3% |
| Distinct | 4831 |
|---|---|
| Distinct (%) | 79.5% |
| Missing | 18 |
| Missing (%) | 0.3% |
| Memory size | 535.0 KiB |
Length
| Max length | 210 |
|---|---|
| Median length | 139 |
| Mean length | 32.807142 |
| Min length | 3 |
Characters and Unicode
| Total characters | 199369 |
|---|---|
| Distinct characters | 92 |
| Distinct categories | 13 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 4497 ? |
|---|---|
| Unique (%) | 74.0% |
Sample
| 1st row | WA Today, 6/11/2017 |
|---|---|
| 2nd row | Daytona Beach News-Journal, 6/10/2017 |
| 3rd row | C. Moore, GSAF |
| 4th row | Nine News, 6/7/2017 |
| 5th row | Tribune 242, 6/2/2017 |
| Value | Count | Frequency (%) |
| gsaf | 1119 | 3.8% |
| 550 | 1.9% | |
| m | 523 | 1.8% |
| v.m | 518 | 1.7% |
| coppleson | 506 | 1.7% |
| r | 454 | 1.5% |
| c | 420 | 1.4% |
| j | 413 | 1.4% |
| a | 409 | 1.4% |
| the | 403 | 1.4% |
| Other values (6667) | 24333 |
Most occurring characters
| Value | Count | Frequency (%) |
| 24768 | 12.4% | |
| e | 11642 | 5.8% |
| . | 9485 | 4.8% |
| 1 | 8549 | 4.3% |
| , | 7584 | 3.8% |
| a | 7582 | 3.8% |
| r | 7057 | 3.5% |
| / | 6627 | 3.3% |
| n | 6624 | 3.3% |
| o | 6433 | 3.2% |
| Other values (82) | 103018 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 82948 | |
| Decimal Number | 33827 | |
| Uppercase Letter | 28404 | 14.2% |
| Other Punctuation | 26519 | 13.3% |
| Space Separator | 24776 | 12.4% |
| Close Punctuation | 1112 | 0.6% |
| Open Punctuation | 1110 | 0.6% |
| Dash Punctuation | 652 | 0.3% |
| Control | 10 | < 0.1% |
| Connector Punctuation | 6 | < 0.1% |
| Other values (3) | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11642 | |
| a | 7582 | |
| r | 7057 | 8.5% |
| n | 6624 | 8.0% |
| o | 6433 | 7.8% |
| l | 6146 | 7.4% |
| i | 6128 | 7.4% |
| s | 4881 | 5.9% |
| p | 4554 | 5.5% |
| t | 4123 | 5.0% |
| Other values (20) | 17778 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3228 | 11.4% |
| C | 2766 | 9.7% |
| A | 2740 | 9.6% |
| M | 2451 | 8.6% |
| G | 1983 | 7.0% |
| F | 1786 | 6.3% |
| T | 1491 | 5.2% |
| D | 1332 | 4.7% |
| B | 1267 | 4.5% |
| N | 1189 | 4.2% |
| Other values (16) | 8171 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9485 | |
| , | 7584 | |
| / | 6627 | |
| ; | 2017 | 7.6% |
| & | 507 | 1.9% |
| # | 190 | 0.7% |
| : | 48 | 0.2% |
| ' | 43 | 0.2% |
| " | 14 | 0.1% |
| ? | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8549 | |
| 2 | 4850 | |
| 9 | 4748 | |
| 0 | 3313 | 9.8% |
| 8 | 2361 | 7.0% |
| 5 | 2227 | 6.6% |
| 3 | 2158 | 6.4% |
| 6 | 2093 | 6.2% |
| 4 | 1828 | 5.4% |
| 7 | 1700 | 5.0% |
Space Separator
| Value | Count | Frequency (%) |
| 24768 | ||
| Â | 8 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1104 | |
| [ | 6 | 0.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1101 | |
| ] | 11 | 1.0% |
Control
| Value | Count | Frequency (%) |
| 7 | ||
| Â’ | 3 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 2 | |
| + | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 652 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 6 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 111352 | |
| Common | 88017 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11642 | 10.5% |
| a | 7582 | 6.8% |
| r | 7057 | 6.3% |
| n | 6624 | 5.9% |
| o | 6433 | 5.8% |
| l | 6146 | 5.5% |
| i | 6128 | 5.5% |
| s | 4881 | 4.4% |
| p | 4554 | 4.1% |
| t | 4123 | 3.7% |
| Other values (46) | 46182 |
Common
| Value | Count | Frequency (%) |
| 24768 | ||
| . | 9485 | 10.8% |
| 1 | 8549 | 9.7% |
| , | 7584 | 8.6% |
| / | 6627 | 7.5% |
| 2 | 4850 | 5.5% |
| 9 | 4748 | 5.4% |
| 0 | 3313 | 3.8% |
| 8 | 2361 | 2.7% |
| 5 | 2227 | 2.5% |
| Other values (26) | 13505 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 199342 | |
| None | 27 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 24768 | 12.4% | |
| e | 11642 | 5.8% |
| . | 9485 | 4.8% |
| 1 | 8549 | 4.3% |
| , | 7584 | 3.8% |
| a | 7582 | 3.8% |
| r | 7057 | 3.5% |
| / | 6627 | 3.3% |
| n | 6624 | 3.3% |
| o | 6433 | 3.2% |
| Other values (76) | 102991 |
None
| Value | Count | Frequency (%) |
| é | 13 | |
| Â | 8 | |
| Â’ | 3 | 11.1% |
| á | 1 | 3.7% |
| è | 1 | 3.7% |
| î | 1 | 3.7% |
pdf
Text
| Distinct | 6083 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 479.7 KiB |
Length
| Max length | 44 |
|---|---|
| Median length | 41 |
| Mean length | 23.580899 |
| Min length | 5 |
Characters and Unicode
| Total characters | 143702 |
|---|---|
| Distinct characters | 69 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6072 ? |
|---|---|
| Unique (%) | 99.6% |
Sample
| 1st row | 2017.06.11-Goff.pdf |
|---|---|
| 2nd row | 2017.06.10.b-Flinders.pdf |
| 3rd row | 2017.06.10.a-Brock.pdf |
| 4th row | 2017.06.07.R-Thomson.pdf |
| 5th row | 2017.06.04-Simpson.pdf |
| Value | Count | Frequency (%) |
| 19 | 0.3% | |
| 5 | 0.1% | |
| fisherman.pdf | 3 | < 0.1% |
| boat.pdf | 3 | < 0.1% |
| 1898.00.00.r-syria.pdf | 2 | < 0.1% |
| 1916.12.08.a-b-german.pdf | 2 | < 0.1% |
| beach.pdf | 2 | < 0.1% |
| bay.pdf | 2 | < 0.1% |
| harbor.pdf | 2 | < 0.1% |
| 1907.10.16.r-hongkong.pdf | 2 | < 0.1% |
| Other values (6134) | 6144 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 20055 | 14.0% |
| 0 | 12966 | 9.0% |
| 1 | 10259 | 7.1% |
| d | 7262 | 5.1% |
| - | 6939 | 4.8% |
| p | 6483 | 4.5% |
| f | 6437 | 4.5% |
| 2 | 5896 | 4.1% |
| 9 | 5807 | 4.0% |
| a | 5661 | 3.9% |
| Other values (59) | 55937 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 59486 | |
| Decimal Number | 48295 | |
| Other Punctuation | 20085 | 14.0% |
| Uppercase Letter | 8678 | 6.0% |
| Dash Punctuation | 6939 | 4.8% |
| Connector Punctuation | 121 | 0.1% |
| Space Separator | 98 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 7262 | |
| p | 6483 | |
| f | 6437 | |
| a | 5661 | |
| e | 4595 | 7.7% |
| r | 3498 | 5.9% |
| n | 3489 | 5.9% |
| o | 3263 | 5.5% |
| i | 3045 | 5.1% |
| l | 2569 | 4.3% |
| Other values (16) | 13184 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 861 | 9.9% |
| S | 782 | 9.0% |
| B | 749 | 8.6% |
| C | 654 | 7.5% |
| M | 632 | 7.3% |
| N | 518 | 6.0% |
| D | 467 | 5.4% |
| H | 423 | 4.9% |
| P | 394 | 4.5% |
| A | 352 | 4.1% |
| Other values (16) | 2846 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12966 | |
| 1 | 10259 | |
| 2 | 5896 | |
| 9 | 5807 | |
| 8 | 2708 | 5.6% |
| 6 | 2382 | 4.9% |
| 7 | 2156 | 4.5% |
| 5 | 2118 | 4.4% |
| 3 | 2067 | 4.3% |
| 4 | 1936 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 20055 | |
| ' | 25 | 0.1% |
| , | 3 | < 0.1% |
| & | 2 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6939 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 121 |
Space Separator
| Value | Count | Frequency (%) |
| 98 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 75538 | |
| Latin | 68164 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 7262 | 10.7% |
| p | 6483 | 9.5% |
| f | 6437 | 9.4% |
| a | 5661 | 8.3% |
| e | 4595 | 6.7% |
| r | 3498 | 5.1% |
| n | 3489 | 5.1% |
| o | 3263 | 4.8% |
| i | 3045 | 4.5% |
| l | 2569 | 3.8% |
| Other values (42) | 21862 |
Common
| Value | Count | Frequency (%) |
| . | 20055 | |
| 0 | 12966 | |
| 1 | 10259 | |
| - | 6939 | 9.2% |
| 2 | 5896 | 7.8% |
| 9 | 5807 | 7.7% |
| 8 | 2708 | 3.6% |
| 6 | 2382 | 3.2% |
| 7 | 2156 | 2.9% |
| 5 | 2118 | 2.8% |
| Other values (7) | 4252 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 143702 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 20055 | 14.0% |
| 0 | 12966 | 9.0% |
| 1 | 10259 | 7.1% |
| d | 7262 | 5.1% |
| - | 6939 | 4.8% |
| p | 6483 | 4.5% |
| f | 6437 | 4.5% |
| 2 | 5896 | 4.1% |
| 9 | 5807 | 4.0% |
| a | 5661 | 3.9% |
| Other values (59) | 55937 |
href formula
Text
| Distinct | 6082 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 800.9 KiB |
Length
| Max length | 98 |
|---|---|
| Median length | 95 |
| Mean length | 77.563433 |
| Min length | 7 |
Characters and Unicode
| Total characters | 472594 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6071 ? |
|---|---|
| Unique (%) | 99.6% |
Sample
| 1st row | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.11-Goff.pdf |
|---|---|
| 2nd row | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.10.b-Flinders.pdf |
| 3rd row | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.10.a-Brock.pdf |
| 4th row | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.07.R-Thomson.pdf |
| 5th row | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.04-Simpson.pdf |
| Value | Count | Frequency (%) |
| 19 | 0.3% | |
| 5 | 0.1% | |
| fisherman.pdf | 3 | < 0.1% |
| boat.pdf | 3 | < 0.1% |
| http://sharkattackfile.net/spreadsheets/pdf_directory/1906.09.27.r.a&b-munich-swede.pdf | 2 | < 0.1% |
| http://sharkattackfile.net/spreadsheets/pdf_directory/1916.07.12.a-b-stillwell-fisher.pdf | 2 | < 0.1% |
| beach.pdf | 2 | < 0.1% |
| bay.pdf | 2 | < 0.1% |
| harbor.pdf | 2 | < 0.1% |
| http://sharkattackfile.net/spreadsheets/pdf_directory/1921.11.27.a-b-jack.pdf | 2 | < 0.1% |
| Other values (6133) | 6143 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 44512 | 9.4% |
| e | 41142 | 8.7% |
| / | 30455 | 6.4% |
| a | 30022 | 6.4% |
| r | 27862 | 5.9% |
| s | 26466 | 5.6% |
| . | 26139 | 5.5% |
| d | 25535 | 5.4% |
| p | 24755 | 5.2% |
| h | 19488 | 4.1% |
| Other values (63) | 176218 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 339661 | |
| Other Punctuation | 62717 | 13.3% |
| Decimal Number | 48285 | 10.2% |
| Uppercase Letter | 8683 | 1.8% |
| Dash Punctuation | 6938 | 1.5% |
| Connector Punctuation | 6212 | 1.3% |
| Space Separator | 98 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 44512 | |
| e | 41142 | |
| a | 30022 | |
| r | 27862 | |
| s | 26466 | |
| d | 25535 | 7.5% |
| p | 24755 | 7.3% |
| h | 19488 | 5.7% |
| f | 18618 | 5.5% |
| i | 15227 | 4.5% |
| Other values (16) | 66034 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 861 | 9.9% |
| S | 782 | 9.0% |
| B | 748 | 8.6% |
| C | 654 | 7.5% |
| M | 632 | 7.3% |
| N | 519 | 6.0% |
| D | 467 | 5.4% |
| H | 423 | 4.9% |
| P | 394 | 4.5% |
| A | 353 | 4.1% |
| Other values (16) | 2850 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12962 | |
| 1 | 10255 | |
| 2 | 5896 | |
| 9 | 5805 | |
| 8 | 2708 | 5.6% |
| 6 | 2383 | 4.9% |
| 7 | 2155 | 4.5% |
| 5 | 2117 | 4.4% |
| 3 | 2068 | 4.3% |
| 4 | 1936 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 30455 | |
| . | 26139 | |
| : | 6091 | 9.7% |
| ' | 25 | < 0.1% |
| , | 3 | < 0.1% |
| & | 2 | < 0.1% |
| # | 1 | < 0.1% |
| ! | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6938 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 6212 |
Space Separator
| Value | Count | Frequency (%) |
| 98 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 348344 | |
| Common | 124250 | 26.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 44512 | |
| e | 41142 | |
| a | 30022 | |
| r | 27862 | 8.0% |
| s | 26466 | 7.6% |
| d | 25535 | 7.3% |
| p | 24755 | 7.1% |
| h | 19488 | 5.6% |
| f | 18618 | 5.3% |
| i | 15227 | 4.4% |
| Other values (42) | 74717 |
Common
| Value | Count | Frequency (%) |
| / | 30455 | |
| . | 26139 | |
| 0 | 12962 | |
| 1 | 10255 | 8.3% |
| - | 6938 | 5.6% |
| _ | 6212 | 5.0% |
| : | 6091 | 4.9% |
| 2 | 5896 | 4.7% |
| 9 | 5805 | 4.7% |
| 8 | 2708 | 2.2% |
| Other values (11) | 10789 | 8.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 472594 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 44512 | 9.4% |
| e | 41142 | 8.7% |
| / | 30455 | 6.4% |
| a | 30022 | 6.4% |
| r | 27862 | 5.9% |
| s | 26466 | 5.6% |
| . | 26139 | 5.5% |
| d | 25535 | 5.4% |
| p | 24755 | 5.2% |
| h | 19488 | 4.1% |
| Other values (63) | 176218 |
href
Text
| Distinct | 6076 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 801.9 KiB |
Length
| Max length | 135 |
|---|---|
| Median length | 131 |
| Mean length | 77.731331 |
| Min length | 34 |
Characters and Unicode
| Total characters | 473617 |
|---|---|
| Distinct characters | 71 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6062 ? |
|---|---|
| Unique (%) | 99.5% |
Sample
| 1st row | http://sharkattackfile.net/spreadsheets/pdf_directory/http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.11-Goff.pdf |
|---|---|
| 2nd row | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.10.b-Flinders.pdf |
| 3rd row | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.10.a-Brock.pdf |
| 4th row | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.07.R-Thomson.pdf |
| 5th row | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.04-Simpson.pdf |
| Value | Count | Frequency (%) |
| 21 | 0.3% | |
| 4 | 0.1% | |
| http://sharkattackfile.net/spreadsheets/pdf_directory/w014.01.25-grant.pdf | 4 | 0.1% |
| boat.pdf | 3 | < 0.1% |
| http://sharkattackfile.net/spreadsheets/pdf_directory/2014.10.02.b-vandenberg.pdf | 3 | < 0.1% |
| fisherman.pdf | 3 | < 0.1% |
| http://sharkattackfile.net/spreadsheets/pdf_directory/1934.12.23.a-b-inman.pdf | 2 | < 0.1% |
| crew.pdf | 2 | < 0.1% |
| http://sharkattackfile.net/spreadsheets/pdf_directory/1916.07.12.a-b-stillwell-fisher.pdf | 2 | < 0.1% |
| bay.pdf | 2 | < 0.1% |
| Other values (6129) | 6142 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 44649 | 9.4% |
| e | 41258 | 8.7% |
| / | 30549 | 6.5% |
| a | 30097 | 6.4% |
| r | 27943 | 5.9% |
| s | 26538 | 5.6% |
| . | 26147 | 5.5% |
| d | 25590 | 5.4% |
| p | 24805 | 5.2% |
| h | 19545 | 4.1% |
| Other values (61) | 176496 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 340541 | |
| Other Punctuation | 62834 | 13.3% |
| Decimal Number | 48291 | 10.2% |
| Uppercase Letter | 8677 | 1.8% |
| Dash Punctuation | 6940 | 1.5% |
| Connector Punctuation | 6232 | 1.3% |
| Space Separator | 102 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 44649 | |
| e | 41258 | |
| a | 30097 | |
| r | 27943 | |
| s | 26538 | |
| d | 25590 | 7.5% |
| p | 24805 | 7.3% |
| h | 19545 | 5.7% |
| f | 18649 | 5.5% |
| i | 15267 | 4.5% |
| Other values (16) | 66200 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 859 | 9.9% |
| S | 782 | 9.0% |
| B | 750 | 8.6% |
| C | 653 | 7.5% |
| M | 631 | 7.3% |
| N | 519 | 6.0% |
| D | 467 | 5.4% |
| H | 423 | 4.9% |
| P | 393 | 4.5% |
| A | 352 | 4.1% |
| Other values (16) | 2848 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12967 | |
| 1 | 10258 | |
| 2 | 5894 | |
| 9 | 5808 | |
| 8 | 2707 | 5.6% |
| 6 | 2380 | 4.9% |
| 7 | 2154 | 4.5% |
| 5 | 2121 | 4.4% |
| 3 | 2066 | 4.3% |
| 4 | 1936 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 30549 | |
| . | 26147 | |
| : | 6110 | 9.7% |
| ' | 24 | < 0.1% |
| & | 2 | < 0.1% |
| , | 2 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6940 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 6232 |
Space Separator
| Value | Count | Frequency (%) |
| 102 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 349218 | |
| Common | 124399 | 26.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 44649 | |
| e | 41258 | |
| a | 30097 | |
| r | 27943 | 8.0% |
| s | 26538 | 7.6% |
| d | 25590 | 7.3% |
| p | 24805 | 7.1% |
| h | 19545 | 5.6% |
| f | 18649 | 5.3% |
| i | 15267 | 4.4% |
| Other values (42) | 74877 |
Common
| Value | Count | Frequency (%) |
| / | 30549 | |
| . | 26147 | |
| 0 | 12967 | |
| 1 | 10258 | 8.2% |
| - | 6940 | 5.6% |
| _ | 6232 | 5.0% |
| : | 6110 | 4.9% |
| 2 | 5894 | 4.7% |
| 9 | 5808 | 4.7% |
| 8 | 2707 | 2.2% |
| Other values (9) | 10787 | 8.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 473617 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 44649 | 9.4% |
| e | 41258 | 8.7% |
| / | 30549 | 6.5% |
| a | 30097 | 6.4% |
| r | 27943 | 5.9% |
| s | 26538 | 5.6% |
| . | 26147 | 5.5% |
| d | 25590 | 5.4% |
| p | 24805 | 5.2% |
| h | 19545 | 4.1% |
| Other values (61) | 176496 |
Case Number.1
Text
| Distinct | 6077 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 402.5 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 10 |
| Mean length | 10.61339 |
| Min length | 6 |
Characters and Unicode
| Total characters | 64678 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6060 ? |
|---|---|
| Unique (%) | 99.4% |
Sample
| 1st row | 2017.06.11 |
|---|---|
| 2nd row | 2017.06.10.b |
| 3rd row | 2017.06.10.a |
| 4th row | 2017.06.07.R |
| 5th row | 2017.06.04 |
| Value | Count | Frequency (%) |
| 1923.00.00.a | 2 | < 0.1% |
| 1913.08.27.r | 2 | < 0.1% |
| b | 2 | < 0.1% |
| 2013.10.05 | 2 | < 0.1% |
| 1954.00.00 | 2 | < 0.1% |
| g | 2 | < 0.1% |
| 1966.12.26 | 2 | < 0.1% |
| 2012.09.02.b | 2 | < 0.1% |
| 1962.06.11.b | 2 | < 0.1% |
| 1952.08.04 | 2 | < 0.1% |
| Other values (6070) | 6081 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 14087 | |
| 0 | 12991 | |
| 1 | 10238 | |
| 2 | 5885 | |
| 9 | 5800 | |
| 8 | 2705 | 4.2% |
| 6 | 2381 | 3.7% |
| 7 | 2151 | 3.3% |
| 5 | 2110 | 3.3% |
| 3 | 2066 | 3.2% |
| Other values (25) | 4264 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 48262 | |
| Other Punctuation | 14092 | 21.8% |
| Lowercase Letter | 1525 | 2.4% |
| Uppercase Letter | 757 | 1.2% |
| Dash Punctuation | 32 | < 0.1% |
| Space Separator | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 637 | |
| b | 623 | |
| c | 131 | 8.6% |
| d | 52 | 3.4% |
| e | 29 | 1.9% |
| f | 17 | 1.1% |
| g | 11 | 0.7% |
| h | 8 | 0.5% |
| j | 5 | 0.3% |
| i | 5 | 0.3% |
| Other values (5) | 7 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12991 | |
| 1 | 10238 | |
| 2 | 5885 | |
| 9 | 5800 | |
| 8 | 2705 | 5.6% |
| 6 | 2381 | 4.9% |
| 7 | 2151 | 4.5% |
| 5 | 2110 | 4.4% |
| 3 | 2066 | 4.3% |
| 4 | 1935 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 14087 | |
| & | 2 | < 0.1% |
| / | 2 | < 0.1% |
| , | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 518 | |
| D | 119 | 15.7% |
| N | 119 | 15.7% |
| T | 1 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32 |
Space Separator
| Value | Count | Frequency (%) |
| 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 62396 | |
| Latin | 2282 | 3.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 637 | |
| b | 623 | |
| R | 518 | |
| c | 131 | 5.7% |
| D | 119 | 5.2% |
| N | 119 | 5.2% |
| d | 52 | 2.3% |
| e | 29 | 1.3% |
| f | 17 | 0.7% |
| g | 11 | 0.5% |
| Other values (9) | 26 | 1.1% |
Common
| Value | Count | Frequency (%) |
| . | 14087 | |
| 0 | 12991 | |
| 1 | 10238 | |
| 2 | 5885 | |
| 9 | 5800 | |
| 8 | 2705 | 4.3% |
| 6 | 2381 | 3.8% |
| 7 | 2151 | 3.4% |
| 5 | 2110 | 3.4% |
| 3 | 2066 | 3.3% |
| Other values (6) | 1982 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 14087 | |
| 0 | 12991 | |
| 1 | 10238 | |
| 2 | 5885 | |
| 9 | 5800 | |
| 8 | 2705 | 4.2% |
| 6 | 2381 | 3.7% |
| 7 | 2151 | 3.3% |
| 5 | 2110 | 3.3% |
| 3 | 2066 | 3.2% |
| Other values (25) | 4264 | 6.6% |
Case Number.2
Text
| Distinct | 6078 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 402.5 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 10 |
| Mean length | 10.613718 |
| Min length | 6 |
Characters and Unicode
| Total characters | 64680 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6062 ? |
|---|---|
| Unique (%) | 99.5% |
Sample
| 1st row | 2017.06.11 |
|---|---|
| 2nd row | 2017.06.10.b |
| 3rd row | 2017.06.10.a |
| 4th row | 2017.06.07.R |
| 5th row | 2017.06.04 |
| Value | Count | Frequency (%) |
| 1923.00.00.a | 2 | < 0.1% |
| 1990.05.10 | 2 | < 0.1% |
| 2 | < 0.1% | |
| b | 2 | < 0.1% |
| 2009.12.18 | 2 | < 0.1% |
| 1954.00.00 | 2 | < 0.1% |
| 2013.10.05 | 2 | < 0.1% |
| 2014.08.02 | 2 | < 0.1% |
| 1915.07.06.a.r | 2 | < 0.1% |
| 2006.09.02 | 2 | < 0.1% |
| Other values (6071) | 6081 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 14089 | |
| 0 | 12992 | |
| 1 | 10236 | |
| 2 | 5888 | |
| 9 | 5798 | |
| 8 | 2706 | 4.2% |
| 6 | 2379 | 3.7% |
| 7 | 2150 | 3.3% |
| 5 | 2112 | 3.3% |
| 3 | 2066 | 3.2% |
| Other values (24) | 4264 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 48262 | |
| Other Punctuation | 14093 | 21.8% |
| Lowercase Letter | 1526 | 2.4% |
| Uppercase Letter | 757 | 1.2% |
| Dash Punctuation | 32 | < 0.1% |
| Space Separator | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 638 | |
| b | 623 | |
| c | 131 | 8.6% |
| d | 52 | 3.4% |
| e | 29 | 1.9% |
| f | 17 | 1.1% |
| g | 11 | 0.7% |
| h | 8 | 0.5% |
| j | 5 | 0.3% |
| i | 5 | 0.3% |
| Other values (5) | 7 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 12992 | |
| 1 | 10236 | |
| 2 | 5888 | |
| 9 | 5798 | |
| 8 | 2706 | 5.6% |
| 6 | 2379 | 4.9% |
| 7 | 2150 | 4.5% |
| 5 | 2112 | 4.4% |
| 3 | 2066 | 4.3% |
| 4 | 1935 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 14089 | |
| & | 2 | < 0.1% |
| , | 1 | < 0.1% |
| / | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 519 | |
| D | 119 | 15.7% |
| N | 119 | 15.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32 |
Space Separator
| Value | Count | Frequency (%) |
| 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 62397 | |
| Latin | 2283 | 3.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 638 | |
| b | 623 | |
| R | 519 | |
| c | 131 | 5.7% |
| D | 119 | 5.2% |
| N | 119 | 5.2% |
| d | 52 | 2.3% |
| e | 29 | 1.3% |
| f | 17 | 0.7% |
| g | 11 | 0.5% |
| Other values (8) | 25 | 1.1% |
Common
| Value | Count | Frequency (%) |
| . | 14089 | |
| 0 | 12992 | |
| 1 | 10236 | |
| 2 | 5888 | |
| 9 | 5798 | |
| 8 | 2706 | 4.3% |
| 6 | 2379 | 3.8% |
| 7 | 2150 | 3.4% |
| 5 | 2112 | 3.4% |
| 3 | 2066 | 3.3% |
| Other values (6) | 1981 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64680 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 14089 | |
| 0 | 12992 | |
| 1 | 10236 | |
| 2 | 5888 | |
| 9 | 5798 | |
| 8 | 2706 | 4.2% |
| 6 | 2379 | 3.7% |
| 7 | 2150 | 3.3% |
| 5 | 2112 | 3.3% |
| 3 | 2066 | 3.2% |
| Other values (24) | 4264 | 6.6% |
original order
Real number (ℝ)
HIGH CORRELATION  UNIFORM 
| Distinct | 6093 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3048.4997 |
| Minimum | 2 |
|---|---|
| Maximum | 6095 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.7 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 306.65 |
| Q1 | 1525.25 |
| median | 3048.5 |
| Q3 | 4571.75 |
| 95-th percentile | 5790.35 |
| Maximum | 6095 |
| Range | 6093 |
| Interquartile range (IQR) | 3046.5 |
Descriptive statistics
| Standard deviation | 1759.3311 |
|---|---|
| Coefficient of variation (CV) | 0.57711375 |
| Kurtosis | -1.1999998 |
| Mean | 3048.4997 |
| Median Absolute Deviation (MAD) | 1523.5 |
| Skewness | -5.5140017 × 10-7 |
| Sum | 18577557 |
| Variance | 3095245.8 |
| Monotonicity | Decreasing |
| Value | Count | Frequency (%) |
| 569 | 2 | < 0.1% |
| 6095 | 1 | < 0.1% |
| 2036 | 1 | < 0.1% |
| 2027 | 1 | < 0.1% |
| 2028 | 1 | < 0.1% |
| 2029 | 1 | < 0.1% |
| 2030 | 1 | < 0.1% |
| 2031 | 1 | < 0.1% |
| 2032 | 1 | < 0.1% |
| 2033 | 1 | < 0.1% |
| Other values (6083) | 6083 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 |
| Value | Count | Frequency (%) |
| 6095 | 1 | |
| 6094 | 1 | |
| 6093 | 1 | |
| 6092 | 1 | |
| 6091 | 1 | |
| 6090 | 1 | |
| 6089 | 1 | |
| 6088 | 1 | |
| 6087 | 1 | |
| 6086 | 1 |
| Fatal (Y/N) | Sex | Type | Year | original order | |
|---|---|---|---|---|---|
| Fatal (Y/N) | 1.000 | 0.000 | 0.146 | -0.337 | -0.337 |
| Sex | 0.000 | 1.000 | 0.083 | -0.149 | -0.149 |
| Type | 0.146 | 0.083 | 1.000 | 0.083 | 0.082 |
| Year | -0.337 | -0.149 | 0.083 | 1.000 | 1.000 |
| original order | -0.337 | -0.149 | 0.082 | 1.000 | 1.000 |
| Case Number | Date | Year | Type | Country | Area | Location | Activity | Name | Sex | Age | Injury | Fatal (Y/N) | Time | Species | Investigator or Source | href formula | href | Case Number.1 | Case Number.2 | original order | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2017.06.11 | 2017-06-11 | 2017.0 | Unprovoked | AUSTRALIA | Western Australia | Point Casuarina, Bunbury | Body boarding | Paul Goff | M | 48 | No injury, board bitten | N | 08h30 | White shark, 4 m | WA Today, 6/11/2017 | 2017.06.11-Goff.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.11-Goff.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.11-Goff.pdf | 2017.06.11 | 2017.06.11 | 6095.0 |
| 1 | 2017.06.10.b | 2017-06-10 | 2017.0 | Unprovoked | AUSTRALIA | Victoria | Flinders, Mornington Penisula | Surfing | female | F | NaN | No injury, knocke off board | N | 15h45 | 7 gill shark | NaN | 2017.06.10.b-Flinders.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.10.b-Flinders.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.10.b-Flinders.pdf | 2017.06.10.b | 2017.06.10.b | 6094.0 |
| 2 | 2017.06.10.a | 2017-06-10 | 2017.0 | Unprovoked | USA | Florida | Ponce Inlet, Volusia County | Surfing | Bryan Brock | M | 19 | Laceration to left foot | N | 10h00 | NaN | Daytona Beach News-Journal, 6/10/2017 | 2017.06.10.a-Brock.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.10.a-Brock.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.10.a-Brock.pdf | 2017.06.10.a | 2017.06.10.a | 6093.0 |
| 3 | 2017.06.07.R | Reported 07-Jun-2017 | 2017.0 | Unprovoked | UNITED KINGDOM | South Devon | Bantham Beach | Surfing | Rich Thomson | M | 30 | Bruise to leg, cuts to hand sustained when he hit the shark | N | NaN | 3m shark, probably a smooth hound | C. Moore, GSAF | 2017.06.07.R-Thomson.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.07.R-Thomson.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.07.R-Thomson.pdf | 2017.06.07.R | 2017.06.07.R | 6092.0 |
| 4 | 2017.06.04 | 2017-06-04 | 2017.0 | Unprovoked | USA | Florida | Middle Sambo Reef off Boca Chica, Monroe County | Spearfishing | Parker Simpson | M | NaN | Laceration to shin | N | NaN | 8' shark | Nine News, 6/7/2017 | 2017.06.04-Simpson.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.04-Simpson.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.04-Simpson.pdf | 2017.06.04 | 2017.06.04 | 6091.0 |
| 5 | 2017.06.02 | 2017-06-02 | 2017.0 | Unprovoked | BAHAMAS | New Providence | Athol Island | Snorkeling | Tiffany Johnson | F | 32 | Right forearm severed | N | Shortly before 12h00 | Tiger shark | Tribune 242, 6/2/2017 | 2017.06.02-Johnson.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.02-Johnson.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.06.02-Johnson.pdf | 2017.06.02 | 2017.06.02 | 6090.0 |
| 6 | 2017.05.30 | 2017-05-30 | 2017.0 | Provoked | USA | South Carolina | Awendaw, Charleston County | Touching a shark | Mackenzie Higgins | F | 20 | Right hand bitten by hooked shark PROVOKED INCIDENT | N | NaN | 3' shark | C. Creswell, GSAF | 2017.05.30-Higgins.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.05.30-Higgins.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.05.30-Higgins.pdf | 2017.05.30 | 2017.05.30 | 6089.0 |
| 7 | 2017.05.28 | 2017-05-28 | 2017.0 | Unprovoked | USA | Florida | Off Jupiter | Feeding sharks | Randy Jordan | M | NaN | Lacerations to right arm | N | Morning | Tiger shark | M. Michaelson, GSAF | 2017.05.28-Jordan.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.05.28-Jordan.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.05.28-Jordan.pdf | 2017.05.28 | 2017.05.28 | 6088.0 |
| 8 | 2017.05.27 | 2017-05-27 | 2017.0 | NaN | AUSTRALIA | New South Wales | Evans Head | Fishing | Terry Selwood | M | 73 | Abrasion to right forearm from pectoral fin of a shark that leapt into his boat | N | NaN | NaN | B. Myatt, GSAF | 2017.05.27-Selwood.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.05.27-Selwood.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/http://sharkattackfile.net/spreadsheets/pdf_directory/2017.05.27-Selwood.pdf | 2017.05.27 | 2017.05.27 | 6087.0 |
| 9 | 2017.05.12 | 2017-05-12 | 2017.0 | Unprovoked | UNITED ARAB EMIRATES | Sharjah, | Khor Fakkan | Spearfishing | Al Beloushi | M | 41 | Right leg severely bitten | N | Morning | NaN | Gulf News, 5/13/2017 | 2017.05.12-Beloushi.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.05.12-Beloushi.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/2017.05.12-Beloushi.pdf | 2017.05.12 | 2017.05.12 | 6086.0 |
| Case Number | Date | Year | Type | Country | Area | Location | Activity | Name | Sex | Age | Injury | Fatal (Y/N) | Time | Species | Investigator or Source | href formula | href | Case Number.1 | Case Number.2 | original order | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6085 | ND.0009 | Before 1906 | 0.0 | Unprovoked | AUSTRALIA | NaN | NaN | Fishing | boy | M | NaN | FATAL, knocked overboard by tail of shark & carried off by shark | Y | NaN | Blue pointer | NY Sun, 9/9/1906, referring to account by Louis Becke | ND-0009-boy-Australia.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0009-boy-Australia.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0009-boy-Australia.pdf | ND.0009 | ND.0009 | 10.0 |
| 6086 | ND.0008 | Before 1906 | 0.0 | Unprovoked | AUSTRALIA | NaN | NaN | Fishing | fisherman | M | NaN | FATAL | Y | NaN | Blue pointer | NY Sun, 9/9/1906, referring to account by Louis Becke | ND-0008-Fisherman2-Australia.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0008-Fisherman2-Australia.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0008-Fisherman2-Australia.pdf | ND.0008 | ND.0008 | 9.0 |
| 6087 | ND.0007 | Before 1906 | 0.0 | Unprovoked | AUSTRALIA | NaN | NaN | Fishing | fisherman | M | NaN | FATAL | Y | NaN | Blue pointers | NY Sun, 9/9/1906, referring to account by Louis Becke | ND-0007 - Fisherman-Australia.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0007 - Fisherman-Australia.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0007 - Fisherman-Australia.pdf | ND.0007 | ND.0007 | 8.0 |
| 6088 | ND.0006 | Before 1906 | 0.0 | Unprovoked | AUSTRALIA | New South Wales | Swimming | Arab boy | M | NaN | FATAL | Y | NaN | Said to involve a grey nurse shark that leapt out of the water and seized the boy but species identification is questionable | L. Becke in New York Sun, 9/9/1906; L. Schultz & M. Malin, p.523 | ND-0006-ArabBoy-Prymount.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0006-ArabBoy-Prymount.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0006-ArabBoy-Prymount.pdf | ND.0006 | ND.0006 | 7.0 | |
| 6089 | ND.0005 | Before 1903 | 0.0 | Unprovoked | AUSTRALIA | Western Australia | Roebuck Bay | Diving | male | M | NaN | FATAL | Y | NaN | NaN | H. Taunton; N. Bartlett, p. 234 | ND-0005-RoebuckBay.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0005-RoebuckBay.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0005-RoebuckBay.pdf | ND.0005 | ND.0005 | 6.0 |
| 6090 | ND.0004 | Before 1903 | 0.0 | Unprovoked | AUSTRALIA | Western Australia | NaN | Pearl diving | Ahmun | M | NaN | FATAL | Y | NaN | NaN | H. Taunton; N. Bartlett, pp. 233-234 | ND-0004-Ahmun.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0004-Ahmun.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0004-Ahmun.pdf | ND.0004 | ND.0004 | 5.0 |
| 6091 | ND.0003 | 1900-1905 | 0.0 | Unprovoked | USA | North Carolina | Ocracoke Inlet | Swimming | Coast Guard personnel | M | NaN | FATAL | Y | NaN | NaN | F. Schwartz, p.23; C. Creswell, GSAF | ND-0003-Ocracoke_1900-1905.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0003-Ocracoke_1900-1905.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0003-Ocracoke_1900-1905.pdf | ND.0003 | ND.0003 | 4.0 |
| 6092 | ND.0002 | 1883-1889 | 0.0 | Unprovoked | PANAMA | NaN | Panama Bay 8ºN, 79ºW | NaN | Jules Patterson | M | NaN | FATAL | Y | NaN | NaN | The Sun, 10/20/1938 | ND-0002-JulesPatterson.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0002-JulesPatterson.pdf | http://sharkattackfile.net/spreadsheets/pdf_directory/ND-0002-JulesPatterson.pdf | ND.0002 | ND.0002 | 3.0 |
| 6093 | ND.0001 | 1845-1853 | 0.0 | Unprovoked | CEYLON (SRI LANKA) | Eastern Province | Below the English fort, Trincomalee | Swimming | male | M | 15 | FATAL. "Shark bit him in half, carrying away the lower extremities" | Y | NaN | NaN | S.W. Baker | ND-0001-Ceylon.pdf | http://sharkattackfile.net/spreadsheets/pdf_directoryND-0001-Ceylon.pdf | http://sharkattackfile.net/spreadsheets/pdf_directoryND-0001-Ceylon.pdf | ND.0001 | ND.0001 | 2.0 |
| 6094 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |